The BigDAWG Architecture

نویسندگان

  • Vijay Gadepally
  • Jennie Duggan
  • Aaron J. Elmore
  • Jeremy Kepner
  • Samuel Madden
  • Timothy G. Mattson
  • Michael Stonebraker
چکیده

BigDAWG is a polystore system designed to work on complex problems that naturally span across different processing or storage engines. BigDAWG provides an architecture that supports diverse database systems working with different data models, support for the competing notions of location transparency and semantic completeness via islands of information and a middleware that provides a uniform multi–island interface. In this article, we describe the current architecture of BigDAWG, its application on the MIMIC II medical dataset, and our plans for the mechanics of cross-system queries. During the presentation, we will also deliver a brief demonstration of the current version of BigDAWG.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BigDAWG Polystore Release and Demonstration

The Intel Science and Technology Center for Big Data is developing a reference implementation of a Polystore database. The BigDAWG (Big Data Working Group) system supports “many sizes” of database engines, multiple programming languages and complex analytics for a variety of workloads. Our recent efforts include application of BigDAWG to an ocean metagenomics problem and containerization of Big...

متن کامل

CSCI 2980 Project Report Data Migration from S-Store to BigDAWG

From spring 2016, I've been working with Prof. Stan Zdonik in a project about data migration from S-Store to BigDAWG polystore system. S-Store, which built on top of H-Store, is the world's first transactional streaming database system. S-Store maintains all the transactional support in a traditional relational database, while it supports streaming processing which is needed in the real-time ap...

متن کامل

Demonstrating the BigDAWG Polystore System for Ocean Metagenomics Analysis

In most Big Data applications, the data is heterogeneous. As we have been arguing in a series of papers, storage engines should be well suited to the data they hold. Therefore, a system supporting Big Data applications should be able to expose multiple storage engines through a single interface. We call such systems, polystore systems. Our reference implementation of the polystore concept is ca...

متن کامل

A Demonstration of the BigDAWG Polystore System

This paper presents BigDAWG, a reference implementation of a new architecture for “Big Data” applications. Such applications not only call for large-scale analytics, but also for real-time streaming support, smaller analytics at interactive speeds, data visualization, and cross-storage-system queries. Guided by the principle that “one size does not fit all”, we build on top of a variety of stor...

متن کامل

Data Ingestion for the Connected World

In this paper, we argue that in many “Big Data” applications, getting data into the system correctly and at scale via traditional ETL (Extract, Transform, and Load) processes is a fundamental roadblock to being able to perform timely analytics or make real-time decisions. The best way to address this problem is to build a new architecture for ETL which takes advantage of the push-based nature o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1602.08791  شماره 

صفحات  -

تاریخ انتشار 2016